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Abstract 

Recent studies have revealed that a small non-coding RNA, microRNA (miRNA) down-regulates its mRNA targets. This effect 
is regarded as an important role in various biological processes. Many studies have been devoted to predicting miRNA- 
target interactions. These studies indicate that the interactions may only be functional in some specific tissues, which 
depend on the characteristics of an miRNA. No systematic methods have been established in the literature to investigate 
the correlation between miRNA-target interactions and tissue specificity through microarray data. In this study, we propose 
a method to investigate miRNA-target interaction-supported tissues, which is based on experimentally validated miRNA- 
target interactions. The tissue specificity results by our method are in accordance with the experimental results in the 
literature. 
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Introduction 

MicroRNA is a short non-coding RNA that is approximately 
22 nt, which suppresses gene expressions via translational 
suppression or mRNA degradation by binding to 3 '-untranslated 
regions (3'UTR). The discovery of the first miRNA from 
Caenorhabditis elegans in 1993 inspired a wide variety of miRNA 
studies [1]. At present, approximately 21,264 miRNAs have been 
discovered in many species. 

To study the regulation between miRNAs and genes, miRNA 
target sites are usually predicted by miRNA target prediction tools. 
Many computational target prediction tools, such as MiRanda 
[2] ,TargetScanS [3-5] and RNAhybrid [6] , have been developed. 
In addition, several statistical methods have also been applied to 
build a network of associations between miRNAs and their target 
mRNAs [7-10]. Usually, miRNA target prediction tools predict 
many potential target sites. To reduce the number of false-positive 
target sites, predicted miRNA target sites should be confirmed by 
experiments. Generally, miRNA-target interactions (MTIs) can be 
confirmed by reporter assays, Western blot, microarray experi- 
ments, pSILAC or qRT-PCR. Moreover, many databases, such as 
miRTarBase [11], TarBase [12], miRecords [13] and miR2Di- 
sease [14], have been designed for storing experimentally validated 



MTIs. In particular, the miRTarBase (version 2.1) database has 
collected approximately 3,500 manually curated experimentally 
validated MTIs, including 657 miRNAs and 2,297 target genes 
among 17 species from 985 research articles [11]. The TarBase 
5.0 database has stored approximately 514 MTIs that were 
extracted from 203 papers [12]. The miRecord database, which 
includes 1,529 experimental interactions, is composed of exper- 
imentally validated miRNAs and predicted MTIs [13]. The 
miR2Disease database is aimed at storing experimentally validated 
MTIs, which are deregulated in various human diseases [14]. 
Among these databases, miRTarBase provides more updated 
MTIs than the other databases. 

Importandy, miRNAs have been observed to be tissue-specific 
in many studies. For example, miR-122 can only be detected in 
liver tissues and is undetectable in all other tissues [15]; the 
expression of miR-122 in hepatocellular carcinoma is relatively 
lower than it in healthy liver [16]; miR-1 and miR-143 are 
preferentially expressed in heart and colon tissues, respectively 
[15,17]; miR-126 is an endothelial-specific miRNA that regulates 
vascular integrity and angiogenesis [18]; miR-195 and miR-200c 
are specifically expressed in lung tissues [19]. In addition, some 
miRNAs are biomarkers for detecting cancers, such as miR-221, - 
100, -125b and -21 in pancreatic cancer [20]. 
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Although many researchers have indicated that many miRNAs 
have tissue-specific expression [15-20], no systematic methods 
have been established in the literature to investigate the highly 
negative correlations between an miRNA and its target genes in a 
group of specific tissues through microarray data. Analyzing the 
correlation of expressions between miRNAs and mRNAs is one of 
the methods that has been applied for increasing the confidence of 
predicted miRNA target sites [21-28]. In this study, we have 
developed a statistical method to determine microRNA-target 
interaction-supported tissues (MTI-supported tissues) based on 
experimentally validated miRNA-target interactions. The MTI- 
supported tissues of an miRNA is a group of tissues that this 
miRNA and its targets express in these tissues. 

The major aim of this study is to investigate the MTI-supported 
tissues that are based on experimentally validated miRNA-target 
interactions in the miRTarBase database. At http://tsmti.mbc. 
nctu.edu.tw, we briefly describe how the proposed method is 
applied to identify MTI-supported tissues. The major analytical 
results and the materials in this paper are presented on this 
website. 

Materials and Methods 

The conception of the proposed procedure is briefly shown in 
Figure 1. We use data sets [29] to illustrate our methods. This data 
set includes the miRNAs expression profiles and mRNA expres- 
sion profiles for 89 samples of 1 1 organs from tumor or normal 
tissues. The samples of 11 organs are summarized in Table 1, 
including 68 tumor tissues and 21 normal tissues. The mRNA 
expression profiles that were published in 2005 consist of two 
microarray platforms, GPL80 and GPL98, which represent 
16,063 genes across 89 tissues [29]. Because partial mRNA 
expression levels are missing, only 12,766 of these genes are used 
in our studies. The miRNA expression profiles (GSE2564) are 
composed of the expression data for 288 miRNAs across 249 
tissues [29]. After eliminating duplicate and redundant data, we 
only use data for 163 miRNA across 89 tissues. With the data for 
an miRNA, we intend to investigate the MTI-supported tissues to 
determine whether the miRNA is functional across these tissues 
based on experimentally validated MTIs. As mentioned in the 
Introduction, the miRTarBase database is more informative than 
other databases. We apply the miRTarBase version 2.1 database 
[11] to obtain experimentally validated MTIs, which can be 
accessed at "http:/ /mirtarbase.mbc.nctu.edu.tw/cache/download 
/2. l/miRTarBase_MTI.xls". According to the miRNAs that were 
recorded in GSE2564, we select 743 experimentally validated 
MTIs and analyze the correlations between miRNA and mRNA 
expression profiles across different tissue sets. 

Before analyzing the expression data, we first normalize the 
miRNA and mRNA expression data across 89 tissues. The data 
pre-processing method is provided in the Supplementary Data. 
Some data pre-processing results are presented in Figures SI and 
S2 in File SI. After the data pre-processing, the top 23 miRNAs 
with a target number that is greater than or equal to 1 0 and with a 
number of tissues with expression levels that were greater than 
7.25 are selected and are listed in Table 2. The expression level of 
7.25 is the cutoff point that was used in Huang et al. (2007) [7]. In 
Table 2, four miRNAs, hsa-miR-122, hsa-miR-124, hsa-miR-155 
and hsa-miR-133a, are ignored because there are not enough 
samples for these miRNAs that could be utilized for the analysis. 
Therefore, we analyze 19 miRNAs in this study. 

Before introducing the method, we first describe the motivation 
for proposing the method. Previous studies have shown that 
miRNAs down-regulate their targets [16,30], which results in a 



negative correlation of the microarray expressions between an 
miRNA and its target interactions. However, Figure SI in File SI 
reveals that the absolute values of most correlations of the miRNAs 
and their target interactions across all 89 tissues are not 
significantly large. We, therefore, conclude that the down- 
regulation of an miRNA occurs in some tissues. 

For an miRNA, we first find the associated interactions through 
the microarray dataset and the miRTarBase database and then 
calculate the correlations between the miRNAs and their targets. 
Thus, an miRNA is associated with a correlation set, which 
includes the correlations between this miRNA and its targets. 
Because there is more than one target interaction that is associated 
with an miRNA, our goal is to integrate several correlation 
coefficients between this miRNA and their targets to find MTI- 
supported tissues. Therefore, to determine the MTI-supported 
tissues of this miRNA, we propose using a criterion that is based 
on the two factors in the correlation sets: (i) the average correlation 
coefficient and (ii) the proportion of negative correlation coeffi- 
cients. Due to the down-regulation of the miRNA to its target 
interaction, for a set of true MTI-supported tissues of an miRNA, 
we expect that the expression data between this miRNA and its 
target interactions would be highly negatively correlated. There- 
fore, we expect that the true MTI-supported tissues should satisfy 
the hypothesis that the average correlation coefficients are strongly 
negative and that the proportion of negative correlation coeffi- 
cients is large. 

To describe the proposed method, we first introduce some 
notations. We denote the 89 tissues as t\,...,t%$ and the 19 
miRNAs as m\,...,m\g. The expression value of an miRNA m and 
an mRNA y across the 89 tissues is denoted as (m 1 ,....,m 89 ) and 
(y x ,...,y m ). Let s(m) denote the target interaction (mRNA) 
number that corresponds to this miRNA from the miRTarBase 
database. 

Let A = {t Kl ,...t Kt }, Ki<...<K/,, be a set of A tissues of the 89 
tissues, where h=\A\ is the size of sample set A. The correlation 
coefficient of the miRNA m and the mRNAj; across a sample set A 
is defined as 

... Eti(^--/v)(y' K --/V> 

r m (A) = y 

where fi m and ji y denote the means of m andjy across the tissues in 
the set A, respectively. 

Let r{ mt i}(A),...,r{ m j( m ' ) }(A) denote the correlation coefficients 
of this miRNA and its target interactions across the tissues in set A, 
and let r m (A)= £^=i r^ m ^(A)/s{m) denote the average of these 
correlation coefficients across the tissue set A. In addition, let w neg 
be the number of negative value of the s{m) correlation 
coefficients. Then, we define the negative correlation coefficient 
proportion as. 

Wneg 

x ' s < s(m) 

For the miRNA m, the goal of this study is to find a tissue set O 
such that f m (0) is strongly negative. In addition, because there are 
s(tn) correlation coefficients for this mRNA, we must require that 
the proportion of negative correlation coefficients among these 
s(m) correlation coefficients is greater than a threshold. That is, we 
intend to find a tissue set such that 1 — \V{ nuneg } is small. Thus, we 
propose using the loss function. 
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Figure 1. Tissue selection pipeline for finding MTI-supported tissues of miR-17 based on experimentally validated MTIs. 

Experimentally validated MTIs sharing the same miRNA were selected from the miRTarBase database. The correlations of MTIs across a combination 
of tissues, which were selected by our method, are strongly negative. 
doi:1 0.1 371 /journal.pone.0095697.g001 



In the loss function (1), a is used to adjust the weights of r m (0) and 
L(0,m) = af m (0) + (l-a)(l- W{nuneg] ), (1) 1 .""Wl- T ° ^ O* satisfying the condition (2), it is difficult to 

directly calculate the L(0,tn) value for all sets of O. For a set O 
to select a tissue set O* , such that the minimum of the loss function with h elements, there are C® 9 combinations. Because the range of 
occurs at O* , where a is a constant between 0 and 1, that is, the h value is from 2 to 89, there is a total of 



L(0*,m)= min 0 L(0,m). (2) 



C — d*2 ~\~ -•• ~\~ ^-89 

possible selections of a set O. The calculation complexity is too 



Table 1. Tumor tissues and normal tissues. 



Tissues 


Tumor 


Normal 


Colon 


7 


4 


Uterus 


10 


1 


Prostate 


6 


6 


Pancreatic 


8 


1 


Ovary 


5 


0 


Lung 


5 


2 


Breast 


6 


3 


Bladder 


6 


1 


Mesothelioma 


8 


0 


melanoma 


3 


0 


Kidney 


4 


3 


Total 


68 


21 



doi:1 0.1 371 /joumal.pone.0095697.t001 
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miRNA > mRNA 1 ,mRNA 2 , ...,mRNA„ 

across all tissues 



Select a number ft 



Randomly select 10 5 sample sets 
with ft tissues in the sample set. 



For each tissue set, we calculate 
the correlations between the 
miRNA and its targets. Then 
average these correlations 



For each tissue set, we 
calculate the proportion of 
negative correlations 



Totally, we have the means 
of correlations, 
r v r 2 ,...,r w s 



Totally, we have the 
proportions of negative 
correlations, 



Loss function algorithm 



Calculate Z£, Z£° 
[\ — ar 1 + (1 — a)w 1 



l h° S = « r io 5 + C 1 " a)w 10 s 



V h denotes the smallest l\ 



0* h is the corresponding 
tissue set of l* h 
Find MTI-supported tissues 
0* h for the tissue number ft 



t 


t 


... oi 5 

t 


1*2 


, 11, 


...,i; s 



Select for different tissue numbers ft 
and repeat the procedure again 



The tissue set 0- with the smallest l\ 



MTI-supported tissues 



Figure 2. The flowchart of the proposed algorithm for finding MTI-supported tissues. 

doi:10.1371/journal.pone.0095697.g002 



high to obtain the true O* . Because the total possible combination 
c is too large, we may slightly relax the condition (2) to be 



L(0**,m)= min oesL(0,m) 



(3) 



where S is a set of 0, which is restricted to have d elements instead 
of c elements, where c' < c. In the following analysis, c' is selected 



to be 10 5 . To select an adequate d value, we had tested many 
different d values and found that the selected tissue of using 
c'=10 5 is almost the same with the selected tissues of using 
d> 10 s for many cases. Thus, we adopt c'= 10 s in this analysis. 

When sampling tissue sets, a heuristic method or a brute-force 
method can be adopted. If highly confident MTI-supported tissues 
for an miRNA are available from the literature or other resources, 



PLOS ONE | www.plosone.org 



5 



April 2014 | Volume 9 | Issue 4 | e95697 



microRNA-Target Interaction-Supported Tissues 



0.10 




-»-Top5 - Top 5 MTI-supported tissues 
-»-Top8 - Top 8 MTI-supported tissues 

Topll - Top 11 MTI-supported tissues 
-*-Topl5 - Top 15 MTI-supported tissues 



Figure 3. The loss of function values for different numbers of MTI-supported tissues. 

doi:1 0.1 371 /journal.pone.0095697.g003 

tissue set O** corresponding to this l g value is the MTI-supported 
tissues set. 

In addition to only considering the correlations of the 
expressions between miRNAs and mRNAs, DNA copy-number 
and promoter methylation at the mRNA gene locus on mRNA 
expression may influence the miRNA-mRNA expression associ- 
ation. We have provided R-codes which include the other factors 
in the loss function. The readers can access the R codes at http:// 
tsmti.mbc.nctu.edu.tw/lossfunction2.txt. 

Because the algorithm is based on the loss function (1) to 
evaluate the performance of the correlation, we called this 
algorithm the correlation loss function algorithm. The steps of 
our algorithm are described in the flowchart in Figure 2. 

Practically, for an miRNA and its targets, we are not sure how 
many tissues should be selected. First, we begin by selecting 3 
tissues from all tissues but do not select one tissue because the 
correlation between an miRNA and its targets across one tissue 
cannot be calculated. To find the optimal tissue number of 
different miRNAs, we select the tissue number from 3 to 15. 

In Figure 3, we illustrate the loss of function values with a= 1/4 
of each miRNA and its targets across 5 tissues, 8 tissues, 1 1 tissues 
and 15 tissues, respectively. From the calculation results, we find 
that the loss function value across more than 15 tissues is larger 
than the loss function value across less than 15 tissues. Therefore, 
we do not consider the case with tissue number that is greater than 
15, and only chose 3 to 15 tissue numbers to find the optimal tissue 
number. The results indicate that the optimal tissue number that is 
derived by the algorithm depends on the miRNAs. 

This paper mainly presents the results when a = 1 /4. We have 
used other a values such as a = 1 /3 in the loss function to select 
tissues. The results for a =1/3 are similar to those for a=l/4. 
Because a is used to adjust the weight between the mean of the 
correlations f m {0) and the proportion of positive correlations 1- 
w {m,neg}, an d we are more concern the proportion of positive 
correlations, we put more weight on the second term. In this study, 
the results using a =1/4 lead to good performance. Therefore, 



we suggest directly choosing tissue sets including these tissues in 
implementing the algorithm, which can prune the search space. 
Otherwise, the brute-force sampling is suggested to avoid 
obtaining not objective results. Since the suggested sampling size 
is c' = 1 0 5 , it is not very time-consuming in implementing the 
algorithm when we use the brute-force sampling method. 

The steps of finding the O** under condition (3) is presented in 
the following algorithm. Before preceding the algorithm, we must 
specify the constant a in condition (1). 

To evaluate the performance of our proposed algorithm, we 
adopt a permutation test and a clustering analysis [31] to show the 
superiority of the proposed algorithm. Both methods are described 
in the Supplementary Data. Some comparison results are 
presented in Figure S3 in File SI and Table SI in File SI. The 
results of the two methods uphold our proposed algorithm as an 
efficient approach to select valid MTI-supported tissues of an 
miRNA. 

2.1 Algorithms 

Algorithm for tissue prediction: correlation loss function 
algorithm. Suppose that an miRNA m has s(m) MTIs. 

Stepl: Randomly select a set 0 with h tissues. 

Step2: Calculate the correlations between this miRNA and the 
s(m) mRNAs across the tissues in 0. Then, calculate the mean of 
these correlations, f m (0) and the proportion of negative correla- 
tions, U'{ m „ eg y 

Step3: Use the values r m (0) and wi mneg ] to calculate (1). 
Step4: Repeat Steps 1-3 10 s times. We obtain 10 s L(0,m) 
values. 

Step5: Find out the minimum value of the 10 5 values, say //,. 
List the tissue set, say O/,, corresponding to this //, value, that is 
lh=L(O h ,m). 

Step6: Repeat Steps 1-5 for different h to obtain different h 
values. Find the minimum value of these 4s, say L. Then, the 
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All MTIs (89 tissues) 

hsa-mir-1 7 (Top 6 MTI-supported tissues) 



10 
C 

CD 

Q 



CO - 



CN 




-1.0 



-0.5 



0.0 



0.5 



1.0 




HSA-MIR-17 
0.43 TGFBR2 
-0.05 NCOA3 
-0.05 VEGFA 
-0.15 RUNX1 
-0.32 YES1 
-0.33 PKD2 
-0.37 CDKN1 A 
-0.37GPR137B 
-0.42 CCND1 
-0.5 JAK1 
-0.51 TCEAL1 
-0.53 BCL2L1 1 
-0.58 NPAT 
-0.63 ZNFX1 
-0.64 CCL1 
-0.69 DNAJC27 
-0.7 BCL2 
-0.71 PTEN 
-0.73 PTPRO 
-0.74 MAPK9 
-0.76 MAP3K12 
-0.81 E2F1 
-0.87 BMPR2 
-0.89 MEF2D 



Figure 4. The correlation density plot and heatmap for experimentally validated MTIs (miRTarBase). a. Comparison of correlation 
densities for all 743 experimentally validated MTIs (solid line) and hsa-miR-17 with 24 targets across the top 6 MTI-supported tissues (dashed line), b. 
The expression profiles of hsa-miR-17 and 24 targets across the top 6 MTI-supported tissues. The correlations between the expression profile of hsa- 
miR-17 and the profiles of target genes were annotated next to the gene symbol. Green represented low expression and red represented high 
expression in corresponding genes and tissues. 
doi:1 0.1 371 /journal.pone.0095697.g004 



a = 1/4 is a suggested value in using this algorithm. For other real 
applications of this algorithm, the readers can use training data to 
obtain a suitable a value. 

In addition, to make a more objective analysis in selecting the 
MTI-supported tissues, instead of selecting a tissue set among 
tissue sets for /z = 3 to 15 which minimizes the loss function (1), we 
proposed a method to rank tissues by the following two steps. The 
first step is to find the 13 tissue sets which minimize the loss 
function corresponding to h = 3 to 15, respectively. The second 
step is to rank the tissues appeared in these 1 3 tissue sets according 
to their occurrence numbers among the 13 tissue sets. The tissue 
with the most occurrence frequency is ranked first and so on. The 



CD 
D 



■ AJi 743 MTIs (39 issues) 
hsa-mir-1 7 valti 24 targets (Top 6 MTI-supported Issues) 
hsa-mr-17 witri24 targets (19 extended issues from Top 6 MTI-supported tissi 
hsa-mir-17 wrfli95 predicted targets from TargetScansfTop 6 MTI-supported St 




-1.0 



-0.5 



0.0 



0.5 



1.0 



Figure 5. The correlation density plot. Comparison of correlation 
densities for all 743 experimentally validated MTIs across all 89 tissues 
(black solid line), hsa-miR-17 with 24 targets across the top 6 MTI- 
supported tissues (red dashed line), hsa-miR-17 with 24 targets across 
19 extended tissues (green dashed line) and hsa-miR-17 with 95 
predicted targets across the top 6 MTI-supported tissues (blue dashed 
line). 

doi:1 0.1 371 /journal.pone.0095697.g005 



ranking results are presented in Table 3, which can provide 
information on the significance of the MTI-supported tissues. 

Results 

Using the algorithm, we obtain MTI-supported tissues for 19 
miRNAs that are listed in Table 2. In the following, we use hsa- 
miR-17 as an example to describe the analysis result. Figure 4 
presents the plots for the density function of correlation and the 
heatmap of hsa-miR-17. Figure 4(a) shows the density plot of 
correlations (solid line) of all 743 experimentally validated MTIs 
across all 89 tissues and the density plot of correlations (dashed 
line) between hsa-miR-17 and its targets across the top 6 MTI- 
supported tissues. The solid line is symmetric and centralized to 
zero; however, the dashed line is right-skewed. Apparendy, the 
right-skewed density plot of the correlations between hsa-miR- 1 7 
and its targets across the top 6 MTI-supported tissues shows that 
most correlations are negative, which is in accordance with the 
degradation behavior between an miRNA and its targets. In 
contrast, the fact that the correlations of all 743 experimentally 
validated MTIs across all 89 tissues are near zero indicates that the 
down-regulation behavior of an miRNA with its targets only 
displays across the MTI-supported tissues. 

Figure 4(b) is an example that shows that the expression profiles 
of most hsa-miR- 1 7 target genes are negatively correlated with the 
expression profile of hsa-miR- 1 7 . In Figure 4(b), the expression 
profiles of hsa-miR- 1 7 and 24 target genes across the top 6 MTI- 
supported tissues are presented. The correlation between the 
expression profiles of hsa-miR-17 and each target gene is 
annotated next to the gene symbol in Figure 4(b). Most of 
expression profiles of hsa-miR-17 target genes are negatively 
correlated with the expression profile of hsa-miR-17. The 
expression profiles of the hsa-miR-17 target gene TGFBR2 is 
positively correlated with the miRNA expression profile. Although 
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b. 



hsa-miR-17 
0.23 CDKN1 A 
0.13TGFBR2 
0.11 NCOA3 
0.11 GPR137B 
0.1 PTEN 
0.04 RUNX1 
-0.05 ZNFX1 
-0.05 VEGFA 
-0.14 MEF2D 
-0.14 PKD2 
-0.15 YES1 
-0.15 BCL2L11 
-0.16CCND1 
-0.24 PTPRO 
-0.24 JAK1 
-0.27 MAP3K12 
-0.33 BCL2 
-0.34 NPAT 
-0.37 CCL1 
-0.39 TCEAL1 
-0.41 MAPK9 
-0.42 BMPR2 
-0.46 E2F1 
-0.47 DNAJC27 
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Figure 6. The heatmaps of hsa-miR-17 expression and regulated genes, a. The expression profiles of hsa-miR-17 and 24 targets across 19 
extended tissues. The correlations between the expression profiles of hsa-miR-17 and target genes were annotated next to the gene symbols. Green 
represented low expression and red represented high expression in corresponding genes and tissues, b. Part of the expression profile of hsa-miR-17 
and experimentally validated and predicted target genes. There are 20 negatively correlated target genes; 16 of these target genes are predicted to 
be hsa-miR-17 target genes. PTEN, MAPK9, MAP3K12 and BMPR2 are experimentally confirmed target genes. 
doi:10.1371/journal.pone.0095697.g006 



it has been experimentally validated that these target genes could 
be inhibited by hsa-miR-17, TGFBR2 is not. The reason for the 
positive correlation between hsa-miR- 1 7 and these genes could be 
that these genes are regulated by other stronger regulatory factors. 
To support our suspicion, we re-examine the studies of hsa-miR- 
1 7 regulation on TGFBR2, which shows that TGFBR2 expression 
could not be detected because of microsatellite-instability and 
mutations [32-34]. 

In addition, the correlation density plots of other 1 8 miRNAs 
and their targets (At http://tsmti.mbc.nctu.edu.tw) reveal similar 
results to that of hsa-miR-17. For example, the proposed 
algorithm searches out 5 top specific tissues for hsa-miR-21, 
which has the largest target number (43 targets) among the 
miRNAs that have been considered in this study. In addition, we 
also find the top 8 MTI-supported tissues for hsa-let-7a and its 
targets. Both of the correlation density plots across MTI-supported 
tissues are right-skewed and are greater than those plots across all 
89 tissues. In addition to the correlation density results, we observe 
that the selected tissue number depends on the miRNA. In the 
Methods section, we show that the optimal tissue number, which is 
derived by the algorithm, is dependent on the miRNAs. Due to the 
limited space, we only provide the elaboration on the example 
miR-17 in details. For other miRNAs, we calculate average 
correlation between an miRNA and its targets across all 89 tissues 
and average correlation between an miRNA and its targets across 
selected MTI-supported tissues. Then we use the value of the 
second average correlation minus the first average correlation as a 
metric to quantify the quality improvement. The results are 
presented in Table 2. The values are range from —0.32 to —0.68. 
The value for miR-17 is —0.536. It reveals that average 
correlation between an miRNA and its targets across selected 
MTI-supported tissues are more strongly negative correlated than 
the average correlation between an miRNA and its targets across 
all 89 tissues. 



Because the top 6 MTI-supported tissues that have been 
selected for hsa-miR-17 include tumor tissues, which may be 
queried with extreme expression levels, we extend the top 6 MTI- 
supported tissues to other tissues with the same organ as the top 6 
MTI-supported tissues. The top 6 MTI-supported tissues for hsa- 
miR-17 include tumor tissues and normal tissues. Tumor tissues 
are composed of prostate, breast and uterus tissues. Normal tissues 
are composed of breast and uterus tissues. The number of tissues is 
extended to 24 because the total number of tumor prostate tissues, 
tumor/normal breast tissues and tumor/normal uterus tissues is 
24. Here, we again examine the expression levels of 24 tissues of 
hsa-mir-1 7. Only 19 expression levels are larger than 7.25. A tissue 
is eliminated if its miRNA expression level that corresponds to the 
tissue is less than 7.25, which is the cutoff point that was used in 
Huang et al. [7] . Figure 5 shows the correlation density plot (green 
dashed line) for the extended results. Comparing Figure 4(a) with 
Figure 5, we add a green dashed line in Figure 5, which is the 
correlation density plot of hsa-miR- 1 7 and its targets across the 1 9 
extended tissues. The green dashed line is always between the 
black solid line and the red dashed line. The result clearly reveals 
that the analysis that is based on 6 MTI-supported tissues leads to 
the best result, followed by the extended tissues, which are better 
than the analysis that is based on 89 tissues. 

Furthermore, we also investigate the target prediction that is 
based on the selected tissues. By searching for the conserved 8mer 
and 7mer sites that match the seed region of each miRNA from 
the TargetScanS prediction tool [3-5], we have 95 predicted 
targets of hsa-miR- 1 7 from our dataset, which are based on the 
top 6 MTI-supported tissues. The blue dashed line in Figure 5 is 
the correlation density of hsa-miR-17 and 95 predicted targets 
across the top 6 MTI-supported tissues. Although a small portion 
of correlations are positive correlations, most of the correlations 
are more negative than that across all 89 tissues. Nevertheless, the 
correlation density of hsa-miR- 1 7 and 95 predicted targets across 
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the top 6 MTI-supported tissues (blue dashed line) does not have a 
better performance than that obtained using 24 experimentally 
validated MTIs across the top 6 MTI-supported tissues (red 
dashed line) and that obtained using 24 experimentally validated 
MTIs across 19 extended tissues (green dashed line). Adopting 95 
predicted targets of hsa-miR- 1 7 across the top 6 MTI-supported 
tissues can still improve the correlations that are not near zero. By 
the extended case and the predicted case, it is concluded that the 
proposed algorithm is reliable for selecting MTI-supported tissues. 

Discussion 

Our approach is focused on discovering MTI-supported tissues 
for an miRNA based on experimentally validated target genes. 
However, we find some mRNAs are not down-regulated by their 
experimentally validated miRNA. Several potential reasons are 
described. First, mRNA expression can be regulated by multiple 
factors including DNA copy number, transcriptional regulation 
and post-transcriptional regulation. Since our approach selects a 
group of different tissues, miRNA repression ability could be 
smaller than other regulation mechanisms in part of selected 
tissues. Second, some miRNAs not only down-regulate their target 
genes, but also up-regulate their target genes [35]. miRTarBase 
does not include any information about an miRNA up-regulates or 
down-regulates its target genes. It is supposed that the miRNA in 
miRTarBase can only down -regulate its target genes. Neverthe- 
less, many miRNAs have been reported that miRNAs can up- 
regulate their target genes [35-38]. For example, the record of 
MIRT004506 in miRTarBase is that miR-4661 up-regulates IL-10 
via binding AU-rich region in 3'UTR [36]. Therefore, some up- 
regulation phenomenons are discovered from the correlation 
density plots across the MTI-supported tissues. 

From the correlation density plots across the MTI-supported 
tissues in Table 2, we find that several experimentally validated 
MTIs are positively correlated with the miRNA expression 
profiles. First, miR-145 is positively correlated to the target gene 
IRS 1 with a correlation of 0.55. A previous report shows that miR- 
145 can down-regulate the protein level of IRS 1 but cannot down- 
regulate the mRNA of IRS1 [39]. Therefore, the negative 
correlation between miR-145 and IRS1 is not observed. Further- 
more, some cell lines, such as BCT-20, do not express IRS1. Thus, 
the down-regulation of IRS 1 by miR-145 cannot be observed [40]. 
The second non-negatively correlated MTI is miR- 1 and SERP 1 . 
The study, which does examine the interaction between miR-1 
and SERP 1 , shows that the down-regulation of SERP 1 by miR- 1 
is not significant [41]. The miRTarBase database might record 
many experimentally validated MTIs whose target genes are 
significandy down-regulated by the corresponding miRNAs. It is 
difficult to determine the reason for some positively correlated 
experimentally validated MTIs, such as the experimentally 
validated MTI between miR-1 and FOXP1. More advanced 
experiments are required to explore these MTIs. 

Figure 6 (a) is an extended illustration of Figure 4. We collect 
expression profiles, which are sampled from the same organs that 
are listed in Figure 4. We observe more conflicted correlations in 
experimentally validated MTIs. First, the correlation between the 
expression of CDFvNIA and hsa-miR-17 is 0.23, and the 
correlation between the expression of RUNX1 and hsa-miR-17 
is 0.04. Previous studies have revealed that CDKN1A and 
RUNX1 are regulated by multiple miRNAs [42,43]. Due to our 
approach, which only observes one miRNA and its corresponding 
target genes, the expression of CDKN1A and RUNX1 could not 
simply negatively correlate to the expression of hsa-miR-17. Two 
conflicted studies show that hsa-miR- 1 7 can or cannot inhibit the 



translation of PTEN. Olive et al. (2009) concluded that PTEN 
could be repressed by miR- 19, but not by hsa-miR-17, in B-cell 
lymphoma [44] . The experiment that was reported by Trompeter 
et al. (2007) shows that PTEN is significandy repressed by hsa- 
miR-17 in HEK293T cells and kidney cells [45]. NCOA3 (AIB1) 
is not correlated with the expression of hsa-miR-17. The 
expression correlations are negative in breast tissues and are in 
agreement with the study that demonstrated that NCOA3 is 
down-regulated by hsa-miR- 1 7 [46] . However, these interactions 
are not negatively correlated with the rest of the tissues. This 
phenomenon indicates that an miRNA-target interaction is MTI- 
supported tissue-specific and reveals that our approach might 
include tissues whose experimentally validated MTIs are not 
functional. Because the seed region of miR- 17, miR- 106a and 
miR-20a are identical, the expression correlation of hsa-miR-17 
and its target genes might be affected by other miRNAs. Thus, our 
approach can determine which target genes are dominandy 
regulated by hsa-miR- 1 7 and find the non-MTI-supported tissues. 

Currently, many studies use differentially expressed genes to 
reduce false-positive miRNA target sites [22,25,47]. We also apply 
our approach to the predicted miRNA target genes. In Figure 6 
(b), 16 predicted target genes have a highly negative correlation 
with the expression of the miRNA. 

In addition, we present the selected tissues of miRNAs and their 
targets in Table 4. Table 4 shows that three of five MTI-supported 
tissues of hsa-miR-34a are prostate tissues; four of five MTI- 
supported tissues of hsa-miR- 106a are colon tissues; three of five 
MTI-supported tissues of hsa-miR-222 are uterus tissues; three of 
five MTI-supported tissues of hsa-miR-26a are pancreatic tissues; 
and four of six MTI-supported tissues of hsa-miR- 141 are 
pancreatic tissues. These tissues account for a high proportion of 
selected MTI-supported tissues. Therefore, we conclude that each 
miRNA can down-regulate its experimentally validated target 
genes in the selected tissues. Although not all of selected tissues 
account for a high proportion of MTI-supported tissues, the 
correlations between an miRNA and its targets across selected 
tissues are highly negative. The individual selected tissues, which 
do not account for a high proportion of MTI-supported tissues, 
can also be suggested as potential MTI-supported tissues of the 
miRNA and its targets. 

Because the correlation results are not significant for explaining 
the down-regulation between an miRNA and its targets, we are 
highly interested in these results, particularly because the results 
conflict with our expectation. The results of our proposed 
algorithm support that experimentally validated MTIs are only 
functional in MTI-supported tissues. In addition, we also provide 
two methods to verify the tissue-specificity; the results uphold our 
proposed algorithm as an efficient approach to select valid MTI- 
supported tissues of an miRNA. Our approach uses publicly 
available microarray data to select the predicted target genes, 
which are highly negatively correlated with the miRNA expression 
profiles. The benefit of our approach is that the experimental cost 
could be reduced; however, the drawback is that the expression 
profiles of the experimental target might differ from our collected 
expression profiles. This drawback can be improved by increasing 
the expression profiles from all type of tissues. Once the numbers 
of miRNA and mRNA expression profiles are increased, our 
approach might provide better performance for selecting potential 
experimentally validated MTIs from the predicted MTIs, and the 
MTI validation could be accelerated by our analysis. 
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Supporting Information 

File SI Figure S1-S3. Figure SI. Summary of the correlations 
of 743 MTIs (miRTarBase) across all 89 tissues, a. The proportion 
of negative correlations is near 50%. b. The amount of 
correlations between —0.5 and 0.5 is larger than 90%. Both 
results reveal that there are no significant correlations among 
MTIs using this data source across all 89 tissues. Figure S2. The 
expression levels of hsa-miR-122 and hsa-let-7a in 89 tissues The 
red line is a cutoff line, and the cutoff value is 7.25 (after log 2 
transformed). The data were eliminated when the expression level 
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